Uniqueness Analysis of Non-Unitary Matrix Joint 
Diagonalization ^ 

Martin Kleinsteuber and Hao Shen 

April 5, 2012 

Abstract 

Matrix Joint Diagonalization (MJD) is a powerful approach for solv- 
ing the Blind Source Separation (BSS) problem. It relies on the construc- 
tion of matrices which are diagonalized by the unknown demixing ma- 
trix. Their joint diagonalizer serves as a correct estimate of this demix- 
ing matrix only if it is uniquely determined. Thus, a critical question 
is under what conditions a joint diagonalizer is unique. In the present 
^SJ work we fully answer this question about the identifiability of MJD based 

, ^ BSS approaches and provide a general result on uniqueness conditions 

^ I of matrix joint diagonalization. It unifies all existing results which ex- 

ploit the concepts of non- circularity, non-stationarity, non-whiteness, and 
non-Gaussiamty. As a corollary, we propose a solution for complex BSS, 
\l which can be formulated in a closed form in terms of an eigenvalue and a 

singular value decomposition of two matrices. 

H 

Index Terms 

^ Non-unitary joint diagonalization, uniqueness analysis. Complex Blind Source 

I— —I Separation (BSS), Second-Order Statistics (SOS), Higher-Order Statistics 

(HOS). 

> 

^ 1 Introduction 

O 

— Joint diagonalization of a set of square matrices is a prominent algorithmic 

' paradigm for solving the Blind Source Separation (BSS) problem jlj. One crit- 

I ical task of the Matrix Joint Diagonalization (MJD) approach is to construct 

^-H a set of square matrices, so that there exists a unique joint diagonalizer, which 

T—i serves as a correct demixing matrix of the BSS problem. Although uniqueness 

^ and solvability conditions of the BSS problem are well known in the framework 

• 'H of Independent Component Analysis (ICA), cf. [2], practical identifiability of 

MJD based BSS approaches has not been systematically studied yet and is 
^ investigated thoroughly in this work. 

By imposing the assumption that source signals are mutually statistically 
independent, it is well known that source signals can only be extracted up to 
arbitrary scaling and permutation. The statistical independence assumption 
leads to the celebrated technique of Independent Component Analysis (ICA) 
[2]. One fundamental question is: Under what conditions on the sources can the 
mixing process be uniquely identified up to permutation and scaling? General 
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identifiability results of the ICA problem have been developed based on either 
the Darmois-Skitovitch theorem, cf. [Hll], or diagonalization of the Hessian of 
the characteristic function, cf. [4]. They provide a theoretic ground for develop- 
ing ICA algorithms that minimize the so-called contrast functions, cf. [5j El [7] . 
Many popular ICA contrast functions originate from information theory, such 
as the mutual information or the differential entropy 9J . Unfortunately, per- 
formance of these contrast function based approaches depends significantly on 
a correct estimation of the distribution of the sources, which is often an infeasi- 
ble undertaking in application scenarios. Although there have been alternative 
non-parametric approaches developed to cope with this difficulty, cf. [101 E] j 
these methods go along with a high computational burden. 

One simple approach to overcome the aforementioned difficulty is to utilize 
some additional properties of the sources for the separation task. Although the 
standard ICA model does not make any assumption on the temporal structure of 
the sources, temporal information is richly available in many real applications, 
and has been extensively exploited in developing efficient ICA algorithms, cf. 
[m HHj . Specifically, these approaches utilize only selected second-order statis- 
tics (SOS) or higher-order statistics (HOS) of the observations, and often result 
in a form of a joint diagonalization problem of a finite number of matrices. 
These matrix joint diagonalization based methods are known as the tensorial 
BSS approaches. In parallel to the general ICA theory, the present work aims 
to answer the following critical question: Under what conditions on the matri- 
ces, which are constructed for joint diagonalization, can we identify the mixing 
process uniquely up to permutation and scaling? 

Early works on Matrix Joint Diagonalization in BSS are restricted to unitary 
transformations, as a whitening process on the observed mixtures is often used as 
a preprocessing step, cf. [14 . However, it has been shown in [TS] that linear BSS 
via a Unitary Joint Diagonalization (UJD) approach may have a serious limit 
of degraded performance in the presence of additive noise. In particular, many 
criterions for joint diagonalization can be significantly distorted by the whitening 
process, cf. [HI [17]. To avoid such a limit of UJD, a natural generalization of 
UJD, known as Non-Unitary Joint Diagonalization (NUJD), has been proposed 
and studied with dramatically increasing attention, cf . [T5J UHl HD] ■ 

Non-stationarity is a common property, which describes the temporal struc- 
ture of a signal. One simple assumption that can be employed for BSS in this 
context is that the covariance matrix of the sources varies over time. By ex- 
ploiting this property, the signals can be separated via a joint diagonalization 
of a finite set of covariance matrices within different time intervals [31] . Identi- 
fiability conditions of this approach are developed in [Tl] for the UJD case, and 
in [32] under a limited NUJD setting, where only the real valued ICA problems 
are considered. Similar approaches employ also cyclo-stationarity of the sources 
[23l [24] , or time-frequency distributions at different time frequency points [25] . 

Another simple temporal concept used in BSS is the non-whiteness of sources. 
Pioneering works in [3S1 [37] show that real valued source signals with distinct 
spectral density functions are blindly identifiable by using only the autocorre- 
lation of the observations. Similar results in [28] show that stationary colored 
complex signals can be blindly separated by using a set of autocorrelation ma- 
trices. When source signals are both stationary and white, it requires more 
knowledge about the signals, such as higher-order statistics [2]. In particular, 
third- and fourth-order cumulants have been used and demonstrate their suc- 
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cess in solving BSS problems, cf. [SSI [301 [31] • In practice, higher order statistics 
are often rearranged in matrix form, so that the matrix joint diagonalization 
approach is applicable. 

All the aforementioned statistical properties can be used for separating both 
real- and complex- valued signals. If the sources are second-order non-circular 
and the values of the circularity coefficients are distinct, complex BSS can be 
solved effectively by a joint diagonalization of only one covariance matrix and 
one pseudo-covariance matrix, cf. [32j . The corresponding method is known as 
Strong Uncorrelating Transform (SUT) |33j . Unfortunately, a solution given by 
SUT does not in general yield a correct demixing of the sources in real applica- 
tions, where noise is commonly present. Recently, generalized SUT approaches 
have been proposed independently in [33| and [3S] to jointly diagonalize both co- 
variance and pseudo-covariance matrices. In particular, the work in |35| demon- 
strates that in the presence of noise, this generalized approach outperforms the 
state-of-the-art MJD approaches in terms of recovery quality. 

To summarize, rich literature is available in the community on developing 
the matrix joint diagonalization based BSS methods. Existing identifiability 
results are mainly focused on the SOS based approaches. However, identifiability 
analysis for the HOS methods has not been addressed systematically. In this 
work, we derive the uniqueness conditions of the NUJD setting. It leads us to 
the most general result so far on identifiability conditions for the HOS based BSS 
methods, and an algebraic solution, i.e. a solution that only involves Eigenvalue 
Decompositions (EVD) or Singular Value Decompositions (SVD). Furthermore, 
it also provides a rigorous analysis on the convergence properties of existing 
iterative algorithms [36]. This is due to the fact that isolated critical points of 
functions can be identified which measure the degree of joint diagonality. This 
issue is not discussed further in this paper and is subject matter of ongoing work 
of the authors. 

The paper is organized as follows. Section [2] gives a setting of the complex 
BSS problem and motivates the non-unitray joint diagonalization approach as 
a solution to BSS. In Section[3]we derive necessary and sufficient conditions for 
the uniqueness of non-unitary joint diagonalization. In Section[4]this uniqueness 
result is used to analyze the identifiability of tensorial BSS methods and to 
propose a new algebraic solution which generalizes the SUT approach and is 
able to separate non-circular signals with indistinct circularity coefficients. 

2 Complex BSS and Matrix Joint Diagonaliza- 
tion 

In this section we review the complex linear BSS problem to make this work 
self-contained, together with several second- and higher-order statistics based 
BSS approaches. Thereafter, we introduce a non-unitary joint diagonalization 
approach which is general enough to unify all the existing approaches in the 
literature. 

2.1 Notations 

We denote by (•)^ the matrix transpose, by {■)^ the Hermitian transpose, and by 
(•)* the (entry-wise) complex conjugate. Furthermore, \z\ = \J zz* , Jftz and 
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denotes the modulus, the real part and the imaginary part of z S C, respectively. 
The complex unit is denoted by i :— V—l- {-Y stands optionally for either the 
matrix transpose or the Hermitian transpose. Matrices are denoted with capital 
Roman and Greek letters, e.g. A, fi. Vectors are in lower case bold face, e.g. 
SjW. The expectation value of a random variable is denoted by E[-]. 

By Gl{m) we denote the set of all invertible (m x m)-matrices. Im is the 
(m X m)-identity matrix, and the sets of all unitary and real orthogonal (m x m)- 
matrices are defined as 

U{m) := {X (^Gl{m)\X^X = Ira} and (1) 
0(m) J7(TO)nE"^"\ (2) 

respectively. The set of all complex orthogonal (m x m)-matrices is given by 

0(to, C) := {X e Gl{m)\X'^X = /„,}. (3) 

For C G and X E Gl{m), we define the linear transformations 

C ^XCX^, (4a) 
C^XCX", (4b) 

as the transpose congruence transform and Hermitian congruence transform, 
respectively. Finally, we denote by © the exclusive disjunction operator. 



2.2 Properties of Complex Signals 

In this work we model a complex signal s{t) = x{t)+iy(t) as a complex stochastic 
process indexed by the variable t with real x{t) and y{t). Let [s(ti), . . . , s{tn)]^ 
be an n-dimensional induced random vector of the signal s(t). 



2.2.1 Stationarity 

A signal s{t) is said to be completely stationary if the joint probability distri- 
bution of [s{ti), . . . ,s{tn)]^ is identical to the joint probability distribution of 
[s(ti — r), . . . , s{tn — t)]^ for any n, cf. 37 . A real signal x{t) is said to be 
weakly stationary, if the following holds: 

(i) E[x{t)] = E[x{t + t)] for aU t G M and 

(ii) E[x(ti)x(i2)] = E[x{ti + T)x{t2 + r)]. 

The first property states that the mean of the signal is constant, and the second 
that the correlation only depends on the time difference ti — 12- 



2.2.2 Circularity 

A complex signal s{t) = x{t) + iy{t) is said to be (weakly) circular, if s{t) 
and e'"s(t) have the same probability distribution. The circularity assumption 
implies E[s(t)2] = e2'"E[s(t)2] for aU a, i.e. E[s(t)2] = 0. Given a complex 
signal s{t) with a bounded variance, i.e. E[|s(t)p] < oo, the following quantity 

_ \E[sm\ 
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is referred to as the circularity coefficient of s{t). The definition of circularity 
can be extended straightforwardly to the case of multiple signals. Let s{t) G C" 
be a vector consisting of m signals. Then s{t) is circular if s(t) and e'"s(t) have 
the same probability distribution. 

A signal s{t) is said to be completely circular if the induced random vector 
[s(ti), . . . , s{tn)]^ is circular for all n G N. A signal is circular of order n if the 
induced vectors of order lower or equal to n are circular, cf. [38) . 

2.2.3 Whiteness 

A real signal x{t) is said to be white if 

(i) E[x{t)] = 0; 

(ii) E[x{ti)x{t2)]=cS{ti-t2), 

where S is the Kronecker delta function and c some positive constant. We refer 
to [39] for generalization of the concept of whiteness to higher order. 

2.3 Complex Linear BSS Model 

Let s{t) — [si(t), . . . ,Sm{t)Y be an m-dimensional mutually statistically inde- 
pendent complex signal. The noise-free instantaneous linear complex BSS model 
is given by 

w{t) = As{t), (6) 

where A G Gl{m) is the mixing matrix and 'w{t) = [wi{t), . . . , Wfn{t)]^ presents 
m observed linear mixtures of s{t). Without loss of generality, we assume that 
the sources s{t) have zero mean, i.e. E[s(i)] = 0, cf. il^. 

The task of the linear complex BSS problem ^ is to recover the source 
signals s(t) by estimating the mixing matrix A or its inverse A~^ only based on 
the observations w(t) via the demixing model 

y{t) = X^'wit), (7) 

where E Gl(m) is the demixing matrix, which is an estimation of A^^ , and 
y{t) represents the corresponding extracted signals. The statistical indepen- 
dence assumption provides various statistical properties of sources to identify 
the demixing matrix. The widely used properties include non- circularity, non- 
stationarity, non-whiteness, and non-Gaussianity. 

2.4 Second-Order Statistics Based ICA Approaches 

In this subsection, we briefly review the second-order statistics based ICA ap- 
proaches and motivate our general approach of joint diagonalization. 

Given the mixing model ([6]), the covariance matrix of the observations w{t) 
is computed as 

C^{t) := E[w{t)w^{t)] = AE[s(t)s^(t)] A^ (8) 

^ ^ ^ 

=:C.(t) 

where the covariance matrix of the sources Cs(t) is diagonal and non-negative 
following the statistical independence assumption. When the source signals are 
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assumed to be non-stationary, i.e. C^(ii) ^ Cw{tj) for ti ^ tj, the demix- 
ing matrix is expected to be identifiable via a joint diagonalization of a set of 
covariance matrices at different times. 

In order to separate stationary but non-white signals, one possibility is to use 
the non-zero autocorrelations at different time instances ti and ^2 with ti ^ t2, 
namely 

CUh,t2) ■.= E[w{h)w''{h)] = ACs{h,h)A". (9) 

Note that, although the autocorrelation matrix Cs{ti,t2) of the sources is still 
diagonal, it is not real in general. In other words, the autocorrelation matrix 
of the observations is generally not a Hermitian matrix and consequently not 
positive definite either. Similarly as above, the demixing matrix is expected to 
be identifiable via a joint diagonalization of a set of autocorrelation matrices 
with different time pairs. 

If the signals have a non-trivial imaginary part, additional properties can be 
employed for BSS. Besides the standard covariance matrix Q, a similar statis- 
tical quantity of complex valued signals, known as pseudo-covariance matrix, is 
defined as 

R^{t) := E[w{t)w'^{t] = ARs{t)A^. (10) 

The works in [321 133j have shown that, if the sources are all non-circular with 
distinct circularity coefficients ([s]), i.e. distinct diagonal entries of Rs{t), the 
demixing matrix can be successfully identified by jointly diagonalizing both the 
covariance and the pseudo-covariance matrix. The resulting algebraic solution, 
referred to as Strong Uncorrelating Transform, provides a simple answer to 
the complex BSS problem. However, it fails in separating non-circular signals 
with same circularity coefficients. To overcome this problem, one can either 
utilize iterative contrast function based algorithms or employ some additional 
information, as for example the pseudo-autocorrelation matrix of the signals, 
which is defined as 

R^itiM) ■.= ¥.[w{t^)w^ {t^)] = ARs{t^,t2)A' . (11) 

Note that both the pseudo-covariance and pseudo-autocorrelation matrix are 
complex symmetric. Recent work in [40^ considers the problem of jointly di- 
agonalizing a set of both auto-correlation and pseudo-autocorrelation matrices. 
The identifiability results for this approach are still lacking in the literature and 
follow from our main result in Section [31 

2.5 Higher-Order Statistics (Tensor) Based ICA Approaches 

In many real applications, second-order statistics may not be sufficient to ac- 
complish the task of separation. In these situations higher-order statistics can 
be exploited. For example, statistically independent non-Gaussian signals can 
be blindly separated by using the fourth-order J4J, or higher-order cumulants, 

cf. [mill]. 

Recalling the model as given in the fc-th order cumulant tensor of the 
sources s{t), denoted by ci*i,' G (C™)'', is defined with its (ii, . . . ,ife)-th entry 
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by 



=5:(-ir\p-i)!E[n4*\i)]-----iE[n4*^(^) 

p=l qeJi 



(12) 



q&Jp 



where t — G {0,1}'^ is a binary vector which enables or disables 

complex conjugate in each dimension, i.e. 



no complex conjugate, 

1 complex conjugate. 



(13) 



The summation in (12 1 involves all possible partitions {Ji, . . . , Jp} {1 < p < k) 
of the indices {ii,...,ifc}. We refer to [111133] for further details regarding 
higher-order cumulant tensors. 



Now, by varying two selected indices, say (i 



with all other indices 

(k) 

fixed, we obtain one cumulant matrix or cumulant slice of CgJ, denoted by 
{ci'^o){p,q} G C™^™. The assumption that sources are mutually statistically 
independent implies that all off-diagonal entries of the cumulant tensors of any 
order must be zero, i.e., the cumulant matrices (Ci^t){p are diagonal for all 
p ^ q. Multilinear properties of the cumulant tensors lead to 



{p,i} 



(14) 



where (•)' is determined by the construction (12). Similarly, by exploiting 



higher-order non-stationarity or higher-order circularity of the sources, the ICA 
problem is formulated as jointly diagonalizing a set of slices of the cumulant 
tensors via either Hermitian congruence or transpose congruence. Note, that 
up to date only specific cumulant tensors have been considered for BSS [H] 
via joint diagonalization, that end up with (•)^ being Hermitian conjugate in 
Equation (flil). 



In this work, we answer the question on the identifiability of the BSS prob- 
lem based on the joint diagonalization of a finite set of higher-order statistics 
matrices. 



2.6 A Unified NUJD Approach 

We summarize the above observations in a unified approach for non-unitary joint 
diagonalization. Let {Ci}f^i be a set of mxm complex matrices, constructed 

by 

C,^An,A^^, i = l,...n, (15) 

where fli = diag (wii, • ■ • , i^im) G and 17^ ^ 0. Note, that Equation 

( 15 1 allows mixtures of both Hermitian congruence and transpose congruence 
transformations. The task is to find a matrix X G Gl{m) such that the matrices 

{X"a(XH)t.|^ = l,...,n} (16) 

are simultaneously diagonalized. Our present work concentrates on developing 
the uniqueness conditions on the set of fi^'s, such that the matrix A is identifiable 
up to permutation and diagonal scaling. 
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Note that in the Hermitian congruence case, X^CiX, i — 1, . . . ,n, are diag- 
onal if and only if X simultaneously diagonalizes the Hermitian and the skew- 
Hermitian part of the Ci. Namely, by considering the real and the imaginary 



part of in Equation ( 15 1 that corresponds to Ci — AiliA^ separately, we can 



construct two Hermitian matrices as 

Cl = Am,A" and (17) 
= A^n.A^. (18) 

Therefore, without loss of generality, we study an equivalent formulation of 
Problem ( 15 ) by restricting Qi to be real diagonal whenever (•)^ is the Hermitian 



transpose. 

Clearly, the mixing matrix can only be identified up to permutation and 
scaling. We define the set of all column-wise permutated diagonal (m x m)- 
matrices by 

G{m) :— {DP I D E Gl{m) is diagonal and ^^^^ 
P is a permutation matrix}. 

Since Q (m) admits a matrix group structure, we can define the following equiv- 
alence class on C™""", cf. [44]. 

Definition 1 (Essential Equivalence). Let X,Y E Gl{m), then X is said to be 
essentially equivalent to Y , and vice versa, if there exists E e Q{m) such that 

X = YE. (20) 

Moreover, we say that the solution of a matrix equation is essentially unique, if 
the equation admits a unique solution on the set of equivalence classes. 

Since 

X^C,{X^)^^ = {X^A)Vt,{X^A)^^, (21) 

we assume without loss of generality for further studies that the Ci = VLi, 
i = 1, . . . ,n, are already diagonal. Thus, the identifiability analysis is restricted 
to investigating under what conditions the unit equivalence class G{m) admits 
the only solutions to the simultaneous diagonalization problem (|16|). 



3 Uniqueness of Non-Unitary Joint Diagonalzi- 
tion 

In this section we present the main results on the uniqueness analysis of the 



NUJD problem given by Equations ( 15 ) and ( 16 1. In contrast to existing results 



on joint diagonalization, we do not assume the matrices to be real as in [22j . 
positive definite as in [TS| , nor do we restrict the number of matrices to two as 
in [151 US] . For the sake of readability, we outsource the proofs of the results to 
the appendix. 

The identifiability results require a notion of coUinerarity for diagonal ma- 
trices. Let Zi, i = I, . . . ,n, denote n complex diagonal (m x TO)-matrices with 
the diagonal entries zn, . . . , Zim- For a fixed diagonal position fc, we denote by 
Zk := [zik, . . . , ZnkV ^ 'C" the vector consisting of the fc-th diagonal element of 
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each matrix, respectively. Recall that the cosine of the complex angle between 
two vectors v,w G C" is computed as 

c{v,w) := < IhllHIl ^ ^ ' (22) 

I 1 otherwise. 

where denotes the Euclidean norm of a vector v. We measure the coUinear- 
ity of a set of n complex diagonal (m x TO)-matrices by means of the complex 
angle of the vectors formed by stacking the entries at corresponding positions 
together. Precisely, the collincarity measure for the set of Zi's is defined by 

p(Zi, . . . , Z„) := max \c{zk,zi)\. (23) 

l<fc</<m 

Note, that < p < 1 and that p = 1 if and only if there exists a complex scalar 
u! and a pair z^, zi, k ^ I so that z^ = tozi . 

Our first result deals with the simple situations where either only purely 
Hermitian or purely complex symmetric matrices are involved. The techniques 
used for deriving the uniqueness conditions for both cases are adapted from the 
work in [22j . where only real symmetric matrices are considered. 

Theorem 1. (a) Let Sl^ G i£rnxm ^ j^^ z = 1, . . . ,n, he diagonal, and let X £ 
Gl{m) so that X^^liX* is diagonal as well. Then X is essentially unique if and 
only if p{VLi, . . . ,fJ„) < 1. 

(h) Let Q.i e M™><™^ for i — be diagonal, and let X G Gl{m) so 

that X^^iX is diagonal as well. Then X is essentially unique if and only if 
p(fii,...,f]„) < 1. 

The above theorem answers the identifiability question of complex BSS by 
means of matrix joint diagonalization approaches, when either purely complex 
symmetric, or purely Hermitian matrices are involved. For the situations with 
a mixture of Hermitian and complex symmetric matrices, we continue by firstly 
considering the case of simultaneously diagonalizing one complex symmetric and 
one Hermitian matrix. The following theorem generalizes a result in and 
[48] , wherein the authors require positive definiteness of the Hermitian matrix. 

Theorem 2. Let X e Gl{m) and let 51i he a complex and ^2 « Teal diagonal 
matrix such that 

X^UiX* and X^Vt2X are diagonal. (24) 
Then X is essentially unique if and only if 

|wifc||a;2i| 7^ |wii||w2/c|, (25) 

holds for all pairs (fc, I) with k ^ I. 

Finally, by considering a mixture of multiple Hermitian and complex sym- 
metric matrices, the following theorem completes our answer to the uniqueness 
analysis to the NUJD problem. 

Theorem 3. Let C, = X^VL.X* for i = 1, . . . , s and C'j = X^fl'^X for j = 
1, . . . , /i he diagonal. Moreover, let 

pin,,...,n,) = pin[,...,n'^)^i, (26) 
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then X is essentially unique if and only if there exists no pair (fc, I) with k ^ I, 
such that the following two conditions hold: 

(i) \ciu;k,^i)\^\ciuj'k,u;l)\ = l (27a) 

(ti) ||u;,||||a;n| -||u;^||||u;,'||. (27b) 



4 Applications to Complex BSS 

In this section, we firstly apply the uniqueness results from the previous sec- 
tion to the NUJD based complex BSS methods. The second application of the 
uniqueness results focuses on the development of algebraic solutions, i.e. solu- 
tions that only involve eigenvalue or singular value decompositions. Although 
the algebraic approaches are in general less powerful and less robust to noise 
and estimation errors than their iterative counterparts, cf. '35], these methods 
are of particular interest, as they provide simple, efficient solutions based on 
various powerful eigensolvers, cf. [iSl [5U] . 



4.1 Identifiability of Complex BSS 

From the main results developed in Section [3j any existing identifiability re- 
sult of complex BSS follows straightforwardly. However, to the best of the 
authors' knowledge there are no general results, which unify HOS based NUJD 
approaches. 

Let t :— [ti, . . . ,tm]^ G be a set of time instances for each observed 
signal Wi{t), we define the fc-th order auto-cumulant tensor of the observations 
w{t), cf. |43], denoted by Cw\{t), with its (ii, . . . , ife)-th entry 

fe(t)) cum(^«(tj ..... «;|;^(i.J), (28) 

Similarly as in Equation ( 14 ), the {p, q)-ih. slice of the k-th auto-cumulant tensor 
with a set of given time t is computed as 

fewV :-a(cSW), (29) 

The identifiability of the complex BSS problem via jointly diagonalizing a set 
of higher-order cumulant matrices is summarized as follows. 

Theorem 4 (Identifiability of Complex ICA). Given the complex linear BSS 
model as in ^ and a set of time instances ti :— [tn, . . . , tim]^ for i = 1, . . . , T, 
then the joint diagonalizer of the set 



^{P-l} ) k=2....,K 



(30) 

p,q—l....,m 

is essentially unique and solves the BSS problem up to permutation and scaling, 
if and only if the diagonal matrices 

Ci'IiU)) l,=i,...,T (31) 

{P^9}J k=2.,...,K 
p.g— l,...,m 

fulfill one of the following three conditions: 
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(i) p{S) < 1, where S denotes the set of cumulant matrices constructed via 
transpose congruence, i.e. 

'5:=|(cW(ti))^^^J^p®i,=o|; (32) 

(ii) pi'H) < 1, where "H is the set of cumulant matrices constructed via Her- 
mitian congruence, i.e. 

H := 



(cW(*i)) ip®i, = l\; (33) 



(Hi) When both the previous two conditions are violated, Equation (271 still 
holds. 

Example 1 (Fourth-Order Cumulants). Recall the complex BSS model the 
fourth-order cumulant of a subset of chosen sources (si^ , Si^ , Si^ , Si^) is computed 
explicitly as 



CUm(Si^ , Si2 > ; Su) = ^Si^{t)Si^{t)Si^{t)SiJ^t)] 

- E[s,,{t)sUt)ms4t)s,M 

- E[s,,{t).S,,itmS^2it)s^M 
-E[.S,,it).S,Mns^2(.t>^M■ 



(34) 



By taking into account all possible combinations of complex conjugate on each 
component, we have three different fourth- order cumulant tensors 

iC^ii)ii-i4 ■■^cnm{w^,,w,.,,w^^,WiJ, (35a) 
{^^12)^1-^4 ■^cnm{w*^,w^^,w^^,WiJ, (35b) 
iC^ljti-i4 ■^cnm{w*^,w*^,w^^,WiJ. (35c) 

Current works in the BSS literature only focus on the cases, where source signals 



are assumed to be harmonic, i.e. the quantity (35c) does not vanish, while the 
other two are equal to zero, cf. J^3f ^. Theorem 3 in {271 presents a result on the 
identifiability of separating harmonic sources using only the 4,-th order cumulants 



(35c). Certainly, when the sources are non-harmonic, i.e. all possible fourth- 



order cumulants (35) do not vanish, then the BSS problem can be still solvable 
via a joint diagonalization of fourth- order cumulant matrices, even though the 
conditions given in f27^ are violated. 



4.2 Algebraic Solutions to Complex BSS 

In this subsection, we investigate a particularly simple solution to the complex 
BSS problem. It is given in closed form in terms of an eigenvalue and a singular 
value decomposition of two matrices. We refer to such solutions as algebraic 
solutions. These methods are of high interest, since the existence of fast eigen- 
solvers turns them into very fast solvers for BSS. Algebraic solutions exist if 
either both matrices are complex symmetric or Hermitian, and at least one is 
invertible, cf. |15]. For the mixed case, the strong uncorrelating transform 
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(SUT) , where a combination of an eigenvalue decomposition and a Takagi fac- 
torization is used, provides an algebraic solution only if the Hermitian matrix 
is positive definite. In this subsection, we extend this approach and investigate 
the situation of separating non-circular signals with non-distinct circularity co- 
efficients, cf. [ST]. 

Lemma 1. Let Ci,C'2 G Gl{m) he one complex symmetric and one Hermitian 
matrix, respectively, constructed by 

Ci := AQiA^ and (36) 
C2:=A^]2A^ (37) 

where A G Gl{m), fii is complex diagonal, and ^2 is real diagonal. Let C2 = 
UYiU~^ he the Takagi factorization 0/C2. Then, 

(i) the matrix Ci := 'S~^^^U^CiUT,~^^^ admits a matrix factorization of the 
form Ci — VAV^ , where V S 0{m) and A is diagonal; 

(ii) the matrix X := UYi^^^'^V* satisfies 

X^C2X* = L and X^CiX is diagonal. (38) 

As the complex symmetric matrix C2 reflects the pseudo second-order statis- 
tics of complex signals, we name the matrix X Pseudo- Uncorrelating Transform 
(PUT) in referring its connection to SUT. A straightforward computation shows 
that the matrix V consists of the eigenvectors of CiCj , as 

CiCj ^VAV"V*AV^ = VA'^V^. (39) 

Thus, if is a matrix such that CiCj = WA'W^^ and if the eigenvalues 
A' are pairwise distinct, it follows by the uniqueness of the EVD, that V — 
W{W~^W)~^^^DP, where P is a permutation and D is diagonal with entries 
being ±1. Ultimately, we summarize the procedure for computing the PUT in 
Algorithm [l] 



Algorithm 1. Pseudo- Uncorrelating Transform (PUT) 

Step 1: Construct Ci,C2 from the observations w{t), 

where Ci and C2 are constructed via Hermitian 
congruence and matrix congruence, respectively; 

Step 2: Compute the Takagi factorization of C2 = C/St/^; 

Step 3: Let^Ci := j:-^/^U"CiUi:'^/^ , compute EVD of 
CiCj = WAW'^; 

Step 4: Compute V = W{W~^W)-^/^; 

Step 5: Compute the PUT matrix X = U^-^/^V*; 



Remark 1. When the matrix Ci is Hermitian and positive definite, i.e. Ci 
heing the covariance matrix of the observations, then the entries of A in (39) 
are simply the reciprocal of the circularity coefficients of sources. Our result 
coincides with the identifiahility condition of SUT, cf. theorem 2 in 
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Remark 2. The second observation is that the SUT of an arbitrary pair of 
one positive definite Hermitian and one complex symmetric matrix does always 
exist, cf. \48^ - In contrast, the existence of the PUT matrix is not guaranteed 
for an arbitrary pair of a complex symmetric and a (general) Hermitian matrix. 
However, existence of SUT implies the applicability of PUT on an arbitrary pair 
of positive definite Hermitian and complex symmetric matrix. In other words, 
PUT can be considered as a generalization of SUT. 

Corollary 1. For an arbitrary pair of one Hermitian positive definite and one 
non-singular complex symmetric matrix, a PUT matrix always exists. 

Finally, we characterize the applicability of PUT as an effective BSS tech- 
nique. Recall the complex linear BSS model as in ([6]), let t := [ti, . . . , tm]^ e M™ 
represent m time instances of individual observations, and denote by Cs(t) and 
Rs{t) the autocorrelation and pseudo-autocorrelation matrix of the sources s(t), 
respectively. Their (i, j)-th entries are computed as 



and 



Cs{t)] ■.= E[s,{t,)s*{tj)], 



Rsit)\ ■.= E[s,{t,)s,{t,)]. 



(40) 
(41) 



Corollary 2. // one of the two conditions: 

(i) 3?(a(t)) (Rs{t^ ^ 5R(a(<)) {Rsit^ 

\ / a \ / a V ^ ii ^ ^ 



(ii) 3 a(t 



Rsit, 



is fulfilled for all pairs {i,j) with i ^ j, then the joint diagonalizer of one 
Hermitian and one complex symmetric matrix, constructed correspondingly from 
the observations w{t) via PUT, is essentially unique and solves the BSS problem 
up to permutation and scaling. 



5 Conclusions 

In this work, we study the problem of simultaneously diagonalizing a set of 
complex square matrices, and provide a thorough uniqueness analysis of the 
problem. In particular, we focus on its application in the problem of complex 
linear BSS. Our work not only characterizes a general result on identifiabil- 
ity conditions of the MJD based BSS methods, but also derives a generalized 
algebraic BSS solution, i.e. the PUT algorithm. Furthermore, the present re- 
sults may also have impact in the areas of beamforming |52) , and direction of 
arrival estimation [55] . where matrix joint diagonalization approaches play an 
important role. 
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.1 Proof of Theorem [T] 

(a) First, consider the case m — 2 and let 



X = 



Xi X2 
X3 Xi 



e G/(2). 



(42) 



Then X^n.X* is diagonal for i = 1, 



for i = 1. 



,n, if and only if 

UJ*^XlX2 + L0*2X'iXi = 0, (43) 

, n. The corresponding system of linear equations reads as 



Wll 


W21 . 


• W„i 


H 


a;iX2 


Wl2 


W22 


• W„2 




0:3X4 



= 0, 



(44) 



which only has a unique trivial solution if and only if the coefficient matrix 
has rank 2. This is equivalent to p(rii, . . . , f2n) < 1- The trivial solution, i.e. 
X1X2 = x-iXi = 0, together with the invertibility of X yields that either xi = 
and Xi = 0, or, X2—^ and x^ — 0. This, in turn, is equivalent to X € ^/(2). 

Consider now the case m > 2. If p = 1 then there exists a pair (fc, V) such 
that |c(u;i;, = 1 and the same argument as above shows that p = 1 implies 
the non-uniqueness of the joint diagonalizer. For the reverse direction of the 
statement, assume that the joint diagonalizer X is not in Q(m). We have to 
show that this implies p = 1. 

Now assume first that one of the ili's, say fii, is invertible. Then 



(XH)- 



(45) 



for i = 1, . . . ,n, gives the simultaneous eigendecomposition of the diagonal 
matrices ri^rij^^. Since X ^ Q{ra), there exists a pair (A, 2) with k ^ I such 
that 

— = — , (46) 

Wife UJil 

which is equivalent to \c{u3ki'^i)\ — 1 and hence p(f2i, . . . , J7„) = 1. If 
all the Sli's are singular, we distinguish between two cases. Firstly, assume 
that there is a position on the diagonals, say k, where all ujik — 0. Then 
|c(a;fe,a;;)| = 1 holds true for any k 7^ I and thus p = 1- Secondly, if there 
is no common position where all the fi^'s have a zero entry, there exists an 
invertible linear combination, say which can also be diagonalized via the 
same transformations. Then by considering a new set {5^i}"=0' same 
argument as from (45 1 to (46) for the invertible case applies by replacing 17 1 
with VLq. This completes the proof for part (a). 

(b) For TO = 2, the condition that X^VliX is diagonal for all z = 1, ... ,71 
leads to the system of linear equations 









T 






W21 ■ 








W12 


W22 


■ ^n2 




X2Xl 



0, 



(47) 



which admits a non-trivial solution if and only if p(r^i, . . . , ^n) — 1- Now, 
4 = together with the invertibility of X implies that X is essen- 
tially unique. The case for to > 2 is now just as in Section [T] and is omitted 
here. 
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.2 Proof of Theorem! 



We prove an equivalent formulation of Theorem [2j Namely, a matrix X G 
Gl{m) \ Q{m) that fulfills condition (24) exists, if and only if there exists a 
pair (fc, I) with k ^ I such that 



|wifc||w2i| = |wi;||w2fc|. 



(48) 



Firstly, consider the case m = 2. From Equations (43) and (47) we see that 



the condition ( 24 ) is equivalent to 



Wiia;ia::2 + 1^12X3X4 

ijJ2lXlX*2 + UJ22X3XI 



(49) 



Assume now that X G Gl{2) \ G{2) and, without loss of generality |a;ia;2| 7^ 0. 



Then either |a;3X4| 7^ and Equation (49) yields 



1^^121 



\X3X4\ 
\X1X2\ 



IW22I 



\X3X4[ 
\X1X2\ 



(50) 



or |a;3a;4| = 0. Both cases imply Equation (48) 



For the other direction, let Equation ( 48 ) hold true 
a common diagonalizer in Gl(2)\Q{2) 



We construct explicitly 
The case when either 17i = or f22 = 



is trivial and not further discussed. Equation (48) implies 



exp(i(/Ji) 



exp (i(/?2) 



(51) 



with suitable ipi G [0, 2tt) and r > 0. Firstly, assume that one, and hence both, 
matrices fii and ft2 are not invertible. We choose without loss of generality 
a;22 to be 0. Equation (49) now implies X1X2 — 0, but 0:3 and X4 can be chosen 



arbitrarily. Indeed, it is easily checked that in this case, 



X := 



1 1 
1 



(52) 



is a common diagonalizer. Assume now that both, fli and Q2 are invertible. 
Then it is straightforwardly verified that 



X := en: 



-1/2 



exp (-5(^1) 



exp (-5(^32) 



(53) 



is a common diagonalizer for any real orthogonal matrix Q G 0(2). 
Now, let m > 2. If Equation (48) holds true, then the case for m 



applies and the diagonalizer is not essentially unique. 

For the reverse direction, we assume firstly that both fli and ^2 are not 
invertible. Then either there exists an index pair (k, I) with k ^ I, such 



that Equation (48) holds true (with zeros on both sides of the equation) and 
it follows again from the case m — 2 that the diagonalizer is not essentially 
unique. Or, Qi and ^2 both have at most one zero diagonal entry at different 
positions. This case will be treated at the end of the proof. 

Let us now consider the case where ^2 is invertible. Assume that the 



diagonalizer is not essentially unique, i.e. that X in Equation (24) (and hence 
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and X*) differs from a product of a diagonal and a permutation matrix. 
The uniqueness of the Q-R-decomposition of the invertible matrix X, i.e. X = 
QxRx, aUows by further decomposing Rx — DxNx with Dx ■— ddiag(i?x) 
and Nx '■= D^Rx the unique factorization 



X = QxDxNx 



(54) 



with unitary Qxi positive real diagonal Dx^ and Nx being upper triangular 
with ddiag(A^x) = Im- Here, ddiag(iVx) forms a diagonal matrix, whose 
diagonal entries are just those of Nx- 

Using this decomposition, X is not in Q (to) if and only if either Nx 7^ In 
or Qx is not a product of a permutation matrix and a diagonal phase shift 
matrix. By a diagonal phase shift matrix, we mean all diagonal matrices in 
[/(to). Using the invertibility assumption on VI2; 



Z : = {X^VL2X)-^X^niX* 

= N^'D],'Q''xn^'niQ*xD*xN*x 

is diagonal. This yields 

D^'Q"x^2'^iQ*xD*x = NxZ{N*x)-^ 



(55) 



(56) 



where the matrix is symmetric on the left hand side and upper triangular on 
the right hand side. This leads us to two conclusions, namely that 



and 



NxZ{N*xy^ is diagonal 



D^^Q^n:^^niQxD*x is diagonal. 



Since Dx = D*j^ is real and diagonal, the last Equation implies that 



R = (5x^2 ^^iQ*x is diagonal 



(57) 
(58) 

(59) 



and hence 



R = NxZ[N*x)- 



Let us have a closer look at Equation (59 1, 
diagonal phase shift matrices $1 and $2 we have 



(60) 

By introducing suitable 



'^lQx^2 



I<^lm/W2 



(61) 



where R is diagonal with real and nonnegative entries. Note that Equation ( 61 ) 
gives a Takagi factorization of R. If Qx differs from a product of a permutation 
matrix and a phase shift matrix, the uniqueness of the Takagi factorization now 
implies that (at least) two diagonal entries have to coincide and consequently 
Equation (48) follows. 

Assume now that Nx differs from the identity, and let its {k, l)-ih entry, 
say z, differ from 0. Now R — ^'{^R and consequently Equation (60) yields 



{Nxy^'^*iRNx = Z. 



(62) 
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Note that, by the special structure of Nx, namely upper triangular with ones 
on the diagonal, this immediately implies Z = Thus, the (fc, /)-th entry 

of equation ( 62 1 reads as 



(63) 



Taking absolute values, this implies \Rkk\ — \Rii \ for the corresponding diagonal 
entries of R and Equation (48) follows. 

Now let us get back to the case where exactly one diagonal entry of ^2 
is zero and the corresponding diagonal entry of fli differs from zero. Since 

(64) 

HiX^nanJrJanznJxn^ is diagonal, (65) 



Equation (24) is equivalent to 

niX"U2U]niU2nlX*Ul is diagonal 

and 



for any permutation matrices 111,112, we assume without loss of generality 
that 



X 



X 



,^1 














M2 



' ^2 


" 









where uJm and X, 0.2 G Gl{m — 1) 
X"n2X = 



Now 



x"n2X 


x"n2Xi 


•k 





(66) 



(67) 



and Equation ( 24 ) together with the invertibility assumption on X and 1^2 
implies Xi = and x 0. Thus 

X^fllX* = -^"""1"^ -r LUm'-V2'-V2 I ^gg^ 



and since x ^ and uim ^ 0, Equation (24) yields that X2 ~ 0. Hence, we 
just showed that if Qi and 5^2 are structured as in Equation (66), X can only 
be a common diagonalizer if 



X 



' X 


" 





X 



(69) 



Now, it is clear that X G Gl{m) \ Q{m) if and only if X G Gl{m — 1) \ tj(rn — 
1), and we reduced the problem to the invertible case treated above. This 
concludes the proof of the theorem. 

.3 Proof of Theorem [3] 



Again, we firstly consider the case m — 2. Assumption (26) is equivalent to 
Condition (27a) and due to the fact that Equations (44) and (47) have both 
nontrivial solutions, say 

[xiX2,X3X4]^ and [xiX2, xsxl]^ . (70) 
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Thus, we have 

X\X*2lji\ 

and, by taking absolute values, 



x:iX^ijj2 = 0, 

- X2,xlu32 = 



|xia;2|||u;i|| = |a;3a;4|||u;2| 
|a;ia;2l||'^i|| = |x3X4|||a;2[ 



and Condition (27b) follows. 

To see the reverse direction, let Condition ( 27b[ ) hold true. 



If 



(71) 



(72) 



||u)2|| = or if = ||u;2|| = 0, the non-uniqueness of X follows from 

Theorem [l] Otherwise, (ii) implies (after a possible renumeration) 



UJ2 = rc^^^(jJi 



(73) 



with r > and ipi g [0, 27r). Using Equation (71), we find an explicit 
diagonalizer that is not in Gim), namely 



xi = rexp(i(/?i), X2 - 
X3 = =exp(i(^i), X4 



-1, 



(74) 



where r ^ can be chosen arbitrarily such that X is invertible. 

Let us consider now the case m > 2. If there exists a pair {k, I) with k I, 



such that Conditions (27) hold, then we can use the above argument for the 



corresponding (2 x 2)-sub matrix and conclude that the common diagonalizer is 
not essentially unique. Now, let X G Gl{m) \ Q{m). Assume for the moment 
that at least one per CiS and Cj's is invertible, say, Ci and C(. This implies 

that C^C'^C^^C'f^ = X^VLiVl'p-^n~^X-^, for i = 1, . . . , s and j = 1, . . . , /i, 
is a simultaneous eigendecomposition. Since X G Gl{m) \ Q{m), there must 
be an index pair (fc, I) with k ^ I, such that 



(75) 



for alH = 1, . . . , s and j = 1, . . . ,h. This yields |c(a;fc, a;;)| = |c(u;^, a;j)| — 1 
and hence Equation (i) follows. If none of the Ci is invertible, the same argu- 
ment as in Theorem l| applied to both sets {Cijf^^ and {C'j}'-^i individually 
yields the same conclusion as in (27a). 

Hence, by permuting k and I if necessary, there exist zi, Z2 G C such that 



= 2:10;;, and wj,, = Z2U}'i. 
On the other hand, by Theorem [2j we obtain 

Iwifcll^,! = |^^^^|l^fcl 



(76) 



(77) 



for all i = 1, . . . , s and j = 1, . . . ,h, and hence \zi\ — \z2\. Equation (76) now 



yields Equation (27b) and the proof is complete. 
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.4 Proof of Lemma [T] 



(i) The construction of C2 as in Equations (36) and (37 1 implies 



An2A^ = UT.U~^. (78) 



As diagonal entries of S are all positive, Equation (78 1 is equivalent to 



By inserting fl2 — {fl2^^)'^ into the above equation, it can be seen that V :— 
j.-i/2[/H^^i/2 complex orthogonal. Now, A = UT^^^^Vn^^^^ , and thus 



Equations ( 36 1 and ( 37 1 yield 



= :A 



where A is diagonal. Then, Equation (80 1 is equivalent to 



■£-i/^U^C\U^-^/^ = VAV^. (81) 
(ii) It is straightforward to verify that 

A"CiA: = V~^T,-^^^U"CiUT,-^^^V* = A, (82) 

and 

A^CaA* = V^Y,-^/'^U"C2U*J:-^/^V = /„. (83) 

References 

[1] A. Hyvarinen, J. Karhunen, and E. Oja, Independent Component Analysis. 
New York: Wiley, 2001. 

[2] P. Comon, "Independent component analysis, a new concept?" Signal Pro- 
cessing, vol. 36, no. 3, pp. 287-314, 1994. 

[3] F. J. Theis, "Uniqueness of complex and multidimensional independent 
component analysis," Signal Processing, vol. 84, no. 5, pp. 951-956, 2004. 

[4] , "A new concept for separability problems in blind source separation," 

Neural Computation, vol. 16, no. 9, pp. 1827-1850, 2004. 

[5] A. Hyvarinen, "Fast and robust fixed-point algorithms for independent 
component analysis," IEEE Transactions on Neural Networks, vol. 10, 
no. 3, pp. 626-634, 1999. 

[6] H. Shen, M. Kleinsteuber, and K. Hiiper, "Local convergence analysis of 
FastICA and related algorithms," IEEE Transactions on Neural Networks, 
vol. 19, no. 6, pp. 1022-1032, 2008. 

[7] H. Shen, K. Hiiper, and M. Kleinsteuber, "On FastICA algorithms and 
some generalisations," in Numerical Linear Algebra in Signals, Systems 
and Control, ser. Lecture Notes in Electrical Engineering, P. Van Dooren, 
S. P. Bhattacharyya, R. H. Chan, V. Olshevsky, and A. Routray, Eds. 
Springer Netherlands, 2011, vol. 80, pp. 403-432. 



19 



[8] D.-T. Pham, "Mutual information approach to blind separation of station- 
ary sources," IEEE Transactions on Information Theory, vol. 48, no. 7, pp. 
1935-1946, 2002. 

[9] F. Vrins, D.-T. Pham, and M. Verleysen, "Mixing and non-mixing local 
minima of the entropy contrast for blind source separation," IEEE Trans- 
actions on Information Theory, vol. 53, no. 3, pp. 1030-1042, 2007. 

[10] R. Boscolo, H. Pan, and V. P. Roychowdhury, "Independent component 
analysis based on nonparametric density estimation," IEEE Transactions 
on Neural Networks, vol. 15, no. 1, pp. 55-65, 2004. 

[11] H. Shen, S. Jegelka, and A. Gretton, "Fast kernel-based independent com- 
ponent analysis," IEEE Transactions on Signal Processing, vol. 59, no. 9, 
pp. 3498 -3511, 2009. 

[12] N. Murata, S. Ikeda, and A. Ziehe, "An approach to blind source separation 
based on temporal structure of speech signals," Neurocomputing, vol. 41, 
no. 1-4, pp. 1-24, 2001. 

[13] A. Hyvarinen, "Blind source separation by nonstationarity of variance: 
A cumulant-based approach," IEEE Transactions on Neural Networks, 
vol. 12, no. 6, pp. 1471-1474, 2001. 

[14] J.-F. Cardoso and A. Souloumiac, "Blind beamforming for non Gaussian 
signals," The lEE Proceedings of F, vol. 140, no. 6, pp. 363-370, 1993. 

[15] J.-F. Cardoso, "On the performance of orthogonal source separation algo- 
rithms," in Proceedings of the 9*'* European Signal Processing Conference, 
1994, pp. 776-779. 

[16] A. Yercdor. "Non-orthogonal joint diagonalization in the least-squares sense 
with application in blind source separation," IEEE Transactions on Signal 
Processing, vol. 50, no. 7, pp. 1545-1553, 2002. 

[17] A. Souloumiac, "Joint diagonalization: Is non-orthogonal always preferable 

to orthogonal?" in Proceedings of the S'"'* IEEE International Workshop on 
Computational Advances in Multi-Sensor Adaptive Processing, 2009, pp. 
305-308. 

[18] D.-T. Pham, "Joint approximate diagonalization of positive definite Hermi- 

tian matrices," SIAM Journal on Matrix Analysis and Applications, vol. 22, 
no. 4, pp. 1136-1152, 2001. 

[19] A. Ziehe, P. Laskov, G. Nolte, and K.-R. Miiller, "A fast algorithm for joint 
diagonalization with non-orthogonal transformations and its application to 
blind source separation," Journal of Machine Learning Research, vol. 5, 
pp. 777-800, July 2004. 

[20] A. Souloumiac, "Nonorthogonal joint diagonalization by combining Givens 
and hyperbolic rotations," IEEE Transactions on Signal Processing, vol. 57, 
no. 6, pp. 2222-2231, 2009. 



20 



[21] D.-T. Pham and J.-F. Cardoso, "Blind separation of instantaneous mix- 
tures of nonstationary sources," IEEE Transactions on Signal Processing, 
vol. 49, no. 9, pp. 1837-1848, 2001. 

[22] B. Afsari, "Sensitivity analysis for the problem of matrix joint diagonal- 
ization," SIAM Journal of Matrix Analysis and Application, vol. 30, no. 3, 
pp. 1148-1171, 2008. 

[23] K. Abed-Meraim, Y. Xiang, J. H. Manton, and Y. Hua, "Blind source ssep- 
aration using second-order cyclostationary statistics," IEEE Transactions 
on Signal Processing, vol. 49, no. 4, pp. 694-701, 2001. 

[24] W.-J. Zeng, X.-L. Li, X.-D. Zhang, and X. Jiang, "An improved signal- 
selective direction finding algorithm using second-order cyclic statistics," 
in Proceedings of the 34*'* IEEE International Conference on Acoustics, 
Speech and Signal Processing, 2009, pp. 2141-2144. 

[25] A. Belouchrani, K. Abed-Meraim, M. G. Amin, and A. M. Zoubir, "BUnd 
separation of nonstationary sources," IEEE Signal Processing Letters, 
vol. 11, no. 7, pp. 605-608, 2004. 

[26] L. Tong, V. C. Soon, Y. F. Huang, and R. Liu, "AMUSE: A new blind iden- 
tification algorithms," in Proceedings of IEEE International Symposium on 
Circuits and Systems, vol. 3, 1990, pp. 1784-1787. 

[27] L. Tong, R.-w. Liu, V. C. Soon, and Y.-F. Huang, "Indeterminacy and 
identifiability of blind identification," IEEE Transactions on Circuits and 
Systems, vol. 38, no. 5, pp. 499-509, 1991. 

[28] A. Aissa-El-Bey, K. Abed-Meraim, Y. Grenier, and Y. Hua, "A general 
framework for second-order blind separation of stationary colored sources," 
Signal Processing, vol. 88, no. 9, pp. 2123-2137, 2008. 

[29] L. de Lathauwer, B. de Moor, and J. Vandcwallc, "Independent compo- 
nent analysis and (simultaneous) third-order tensor diagonalization." IEEE 
Transactions on Signal Processing, vol. 49, no. 10, pp. 2262 2271, 2001. 

[30] J.-F. Cardoso, "Source separation using higher order moments," in Pro- 
ceedings of the 13*^* IEEE International Conference on Acoustics, Speech, 
and Signal Processing, 1989, pp. 2109-2112. 

[31] A. K. Nandi and V. Zarzoso, "Fourth-order cumulant based blind source 
separation," IEEE Signal Processing Letters, vol. 3, no. 12, pp. 312-314, 
1996. 

[32] L. de Lathauwer and B. de Moor, "On the blind separation of non-circular 
sources," in Proceedings of the 11*^ European Signal Processing Conference, 
2002, pp. 99-102. 

[33] J. Eriksson and V. Koivunen, "Complex-valued ICA using second order 
statistics," in Proceedings of the 14*'' IEEE International Workshop on 
Machine Learning for Signal Processing, 2004, pp. 183-191. 



21 



[34] T. Trainini, X.-L. Li, E. Moreau, and T. Adah, "A relative gradient algo- 
rithm for joint decompositions of complex matrices," in Proceedings of the 
18*'' European Signal Processing Conference, 2010, pp. 1073-1076. 

[35] H. Shen and M. Kleinsteuber, "Complex blind source separation via si- 
multaneous strong uncorrclating transform," in Lecture Notes in Computer 
Science, Proceedings of the d*'^ International Conference on Latent Variable 
Analysis and Signal Separation, vol. 6365. Berlin/Heidelberg: Springer- 
Verlag, 2010, pp. 287-294. 

[36] H. Shen and K. Hiiper, "Block Jacobi-type methods for non-orthogonal 
joint diagonalisation," in Proceedings of the 34*'* IEEE International Con- 
ference on Acoustics, Speech, and Signal Processing, 2009, pp. 3285 3288. 

[37] M. B. Priestley, Non-linear and Non-stationary Time Series Analysis. Aca- 
demic Press, 1988. 

[38] P. O. Amblard, M. Gaeta, and J. L. Lacoume, "Statistics for complex 
variables and signals - part ii: Signals," Signal Processing, vol. 53, no. 1, 
pp. 15 25, 1996. 

[39] P. Bondon, P. L. Combettes, and B. Picinbono, "Volterra filtering and 
higher order whiteness," IEEE Transactions on Signal Processing, vol. 43, 
no. 9, pp. 2209-2212, 1995. 

[40] X.-L. Li and T. Adah, "Blind separation of noncircular correlated sources 
using Gaussian entropy rate," IEEE Transactions on Signal Processing, 
vol. 59, no. 6, pp. 2969-2975, 2011. 

[41] P. Comon and C. Jutten, Eds., Handbook of Blind Source Separation: In- 
dependent Component Analysis and Applications. Academic Press Inc., 
2010. 

[42] X.-L. Li, T. Adah, and M. Anderson, "Joint blind source separation by 
generalized joint diagonalization of cumulant matrices," Signal Processing, 
vol. 91, no. 10, pp. 2314-2322, 2011. 

[43] J. M. Mendel, "Tutorial on higher-order statistics (spectra) in signal pro- 
cessing and system theory: Theoretical results and some applications," 
Proceedings of the IEEE, vol. 79, no. 3, pp. 278-305, 1991. 

[44] A. Belouchrani, K. A. Meraim, J.-F. Cardoso, and E. Moulines, "A bhnd 
source separation technique based on second order statistics," IEEE Trans- 
actions on Signal Processing, vol. 45, no. 2, pp. 434-444, 1997. 

[45] L. Parra and P. Sajda, "Blind source separation via generalized eigenvalue 
decomposition," The Journal of Machine Learning Research, vol. 4, no. 7-8, 
pp. 1261-1269, 2004. 

[46] E. Ollila and V. Koivunen, "Complex ICA using generalized uncorrelating 
transform," Signal Processing, vol. 89, no. 4, pp. 365-377, 2009. 

[47] J. Eriksson and V. Koivunen, "Complex random vectors and ICA mod- 
els: Identifiability, uniqueness, and separability," IEEE Transactions on 
Information Theory, vol. 52, no. 3, pp. 1017-1029, 2006. 



22 



[48] R. Bcncdetti and P. Cragnolini, "On simultaneous diagonalization of one 
Hermitian and one symmetric form," Linear Algebra and its Applications, 
vol. 57, pp. 215-226, February 1984. 

[49] A. Yeredor, "Performance analysis of the strong uncorrelating transforma- 
tion in blind separation of complex-valued sources," IEEE Transactions on 
Signal Processing, vol. 60, no. 1, pp. 478-483, 2012. 

[50] M. Kleinsteuber, "A sort-Jacobi algorithm for semisimple Lie algebras," 
Linear Algebra and its Applications, vol. 430, no. 1, pp. 155-173, 2009. 

[51] H. Shen and M. Kleinsteuber, "Algebraic solutions to complex blind source 
separation," in Proceedings of the lO*'' International Conference on Latent 
Variable Analysis and Signal Separation, ser. Lecture Notes in Computer 
Science, F. Theis, A. Cichocki, A. Yeredor, and M. Zibulevsky, Eds., vol. 
7191. Springer Berlin/Heidelberg, 2012, pp. 74-81. 

[52] X. Huang, H.-C. Wu, and J. C. Principe, "Robust blind beamforming algo- 
rithm using joint multiple matrix diagonalization," IEEE Sensors Journal, 
vol. 7, no. 1, pp. 130-136, 2007. 

[53] W.-J. Zeng, X.-L. Li, and X.-D. Zhang, "Direction-of-arrival estimation 
based on the joint diagonalization structure of multiple fourth-order cumu- 
lant matrices," IEEE Signal Processing Letters, vol. 16, no. 3, pp. 164-167, 
2009. 



23 



